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ABSTRACT 

k>( , We present measurements of the normalised redshift-space three-point correlation function 

■ (Qz) of galaxies from the Sloan Digital Sky Survey (SDSS) main galaxy sample. These mea- 

surements were possible because of a fast new N-point correlation function algorithm (called 
npt) based on multi-resolutional k-d trees. We have applied npt to both a volume-limited 
(36738 galaxies with 0.05 < z ^ 0.095 and -23 < Mo.o r < -20.5) and magnitude-limited 
sample (134741 galaxies over 0.05 < z sC 0.17 and ~ M* ± 1.5) of SDSS galaxies, and find 
consistent results between the two samples, thus confirming the weak luminosity dependence 
of Q z recently seen by other authors. We compare our results to other Q z measurements in 
the literature and find it to be consistent within the full jack-knife error estimates. However, 
we find these errors are significantly increased by the presence of the "Sloan Great Wall" 
(at z ~ 0.08) within these two SDSS datasets, which changes the 3-point correlation func- 
tion (3PCF) by 70% on large scales (s lOh^ 1 Mpc). If we exclude this supercluster, our 
observed Q z is in better agreement with that obtained from the 2dFGRS by other authors, 
thus demonstrating the sensitivity of these higher-order correlation functions to large-scale 
structures in the Universe. This analysis highlights that the SDSS datasets used here are not 
"fair samples" of the Universe for the estimation of higher-order clustering statistics and 
larger volumes are required. We study the shape-dependence of Q z (s, q, 8) as one expects 
this measurement to depend on scale if the large scale structure in the Universe has grown via 
gravitational instability from Gaussian initial conditions. On small scales (s ^ Qh^ 1 Mpc), 
we see some evidence for shape-dependence in Q z , but at present our measurements are con- 
sistent with a constant within the errors (Q z ~ 0.75 ± 0.05). On scales > 10/i -1 Mpc, we see 
considerable shape-dependence in Q z . However, larger samples are required to improve the 
statistical significance of these measurements on all scales. 

Key words: methods: statistical - surveys - galaxies: statistics - large-scale structure of 
Universe - cosmology: observations 
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1 INTRODUCTION 

Correlation functions are some of the most commonly used statis- 
tics in cosmology. They have a long history in quantifying the clus- 
tering of galaxies in the Universe (see Peebles 1980). There is a 
hierarchy of correlation functions. The two-point correlation func- 
tion (2PCF) compares the number of pairs of data points, as a func- 
tion of separation, with that expected from a Poisson distribution. 
Next in the hierarchy is the 3-point correlation function (3PCF), 
which compares the number of data triplets, as a function of their 
triangular-configuration, to that expected from Poisson. Higher- 
order correlations are defined analogously. 

As discussed by many authors, the higher-order correlation 
functions contain a variety of important cosmological informa- 
tion, which complements that from the 2PCF iGrofh. & Peebles) 
ll977tlBalian"& Schaeffer 1989). These include tests of Gaussian- 
ity and the d eterminati on of galaxy bias as a function of scale l Suto 
1993 t Ijing & BorneJ Il998t iTakada & Jainl 120031 ; Ijing & BorneJ 
2004 iKavo et alj|2004t iLahav & Sutoll2004 . Such tests can also 



be performed using the Fourier -space equivalent of the 3PCF, 
the bi-spectrum <PeebIesl Il980t IScoccimarro et alJ 1 1 9991 1200 it 
IVerdeetaljEool or other statistics suc h as the void pr obabil- 
ity distribution and Minkowski functionalslMeck e et all 19941) . Re- 
cent results from these complementar y statistics using the SDSS 



recursive splitting is stopped when a pre-determined number of data 
points is reached in each node (we used ^ 20 data points herein). 
For our NPCF algorithm, we used an enhanced version of the k-d 
tree technology, namely multi-resolutional k-d trees with cached 
statistics (mrkdtree), which store additional statistical information 
about the search tree, and the data points in each node, e.g., we 
store the total count and centroid of all data in each node. 

The key to our NPCF algorithm is to use multiple mrkdtrees 
together, and store them in main memory of the computer (rather 
than on disk), to represent the required N-point function, e.g., we 
use 3 mrkdtrees to compute the 3PCF, 4 mrkdtrees for the 4PCF, 
and so on. The computational efficiency is increased by pruning 
these trees wherever possible, and by using the cached statistics 
on the tree as much as possible. The details of mrkdtrees and 
our NPCF algorithm (kno wn as npt) have already been outlined 
in several papers iMoore et all200ltlNichol et all2003tlGrav et alJ 
2004). Similar tree-based computational algorithms have been dis- 
cussed by Szapudi et all 1200 ll) . 



3 SDSS DATA 

The details of the SDSS s urvey a re given in a series of tec hnical 
papers by iFukueita et al. I Jl996l); iGunn et al I Jl998l); lYork et al 



cent results trom these complementary statistics using the ^DSS s b Pukugita et al. | fi99dk | G unn et al. I Imi 

main galaxysample include | Hikage et alJ <2002i |2003j, |2005j) and J200(ih : [Hogg et al. I i200lh: IStrauss et alJ <2002h : 

Park et al. (2005). TmCYXl- Pieretal I J?fimlV iRlantnn et al I FfYtthV 



[ 

While the 3PCF is easier to correct for survey edge effects than 
these other statistics, measurements of the 3PCF have been limited 
by the availability of large redshift surveys of galaxies (see Szapudi, 
Meiksin & Nichol 1996, Frieman & Gaztanaga 1999, Szapudi et al. 
2002 for 3PCF analyses of large solid angle catalogues of galaxies) 
and the potentially prohibitive computational time needed to count 
all possible triplets of galaxies (naively, this count scales as 0(N 3 ), 
where N is the number of galaxies in the sample). 

In this paper, we resolve these two problems through the 
application of a new N-point correlation function algorithm 
I Moor e et ali200ll) to the galaxy data of the Sloan Digital Sky Sur- 
vey (SDSS; York et al. 2000). We present herein measurements of 
the 3PCF from the SDSS main galaxy sample. Our measurements 
illustrate the sens itivity of the 3PCF to known large-scale struc- 
tures in the SDS S iGo tt et al] 2005t) . They are complementary to the 
work of IKavo eTalTl2004l) who explicitly explored the luminosity 
and morphological dependence of the 3PCF using SDSS volume- 
limited galaxy samples. These measurements of the 3PCF will help 
facilitate constraints on the biasing of galaxies and will aid in the 
development of theoretical prediction s for the h igher-ord er cor - 
relation functions IScoccimarro et al J l2Q01 : Takada & Jain 2003). 
Throughout this paper, we use the dimensionless Hubble constant 
h = f/o/lOOkms -1 Mpc -1 , the matter density parameter f2 m = 
0.3, and the dimensionless cosmological constant Q,a ~ 0.7, un- 
less stated otherwise. 



2 THE 3PCF COMPUTATIONAL ALGORITHM 

To facilitate the rapid calculation of the higher-order correlation 
functions, we have designed and implemented a new N-point cor- 
relation function (NPCF) algorithm based on k-d trees, which are 
multi-dimensional binary search tree for points in a k-dimensional 
space. The k-d tree is composed of a series of inter-connected 
nodes, which are created by recursively splitting each node along 
its longest dimension, thus creating two smaller child nodes. This 
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1 2002); jpier et al. I J2003h; iBlanton et alJ <2003bl) ; 
(2004); Abazaiian et al. 1 2005). For the computations discussed 
herein, we use two SDSS catalogues. The first is a volume- 
limited sample of 36738 galaxies in the redshift range of 0.05 ^ 
2 ^ 0.095 and absolute magnitude range of —23 ^ Mo o,, ^ 
- 20.5 (for h = 0.7 a nd the z = 0.0 SDSS r filter, or °'°r 
in Blanton et al. 1 2003b) terminology 1 ), covering 2364 deg 2 of 
the SDSS photom etric survey. All the magnitudes were redden- 
ing corrected using Schlegel, Finkbbeiner^&DaviJj lggSl) . and the 
k-corrected vl_16 software ls\m^oi^t ^iK^ ^\K The sec- 
ond sample is the same as "Sample 12" used bv lPope et alJ 1 2004) 
and contains 134741 galaxies over 2406 deg 2 . This latter sample 
is not volume-limited, but is constrained to the absolute magni- 
tude range of -22 < Mo.i r < -19 (or M* ± 1.5 magnitudes) 
for h = 1, and usin g the z = 0.1 SD SS r filter system, or 01 r 
( Blanton et al. 2003b; Zehavi et al. 2005). To compare the two sam- 
ples, our volume-limited sample has the absolute magnitude range 
of -23.54 < Mo.i r sC -21.04 in the same 01 r filter as used for 
the Pope et al. sample; assuming a conversion of ~ °'°r + 0.23 
for the SDSS main galaxy sample with a median color at z = 0.0 of 
(g— r) ~ 0.8. This gives a mean space density of 8.25 x 10 -3 h s 
Mpc -3 , which is comparable to the space den sities of the SDSS 
main galaxy sample given in Table 2 of Ze havi et alJl2005l) . 

We have made no correction for missing galaxies due to fibre- 
collisions (i.e., two SDSS fibres can not be placed closer than 55 
arcseconds on the sky). We do not expect this observational con- 
straint to bias our correlation functions as the adaptive tilting of 
SDSS spectroscopic plates reduces the problem to ~ 7% of pos- 
sible target galaxies being missed (see Blanton et al. 2003a for 
details). Furthermore, this bias will only affect pairs of galaxies 
separated by less than lOO/i -1 kpc, which is significantly smaller 
than the scales studied herein. In each case, we also constructed 



1 lBlantonetai] f2()03b) use redshifted SDSS filters to minimise the effects 
of k-corrections. As discussed in their paper, they propose the use of an 
SDSS filter set redshifted to z = 0.1 for their "rest-frame" quantities. 
These filters are written as 01 u, 01 g, 01 r, i, z 
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Figure 1. Our SDSS measurements of the normalised redshift-space 3PCF as a function of triangle configuration, i.e., Q z (s,u,v). We compare our measure- 
ment with that of Jing & Borner (1998, 2004) for both the 2dFGRS (open circles and error bars) and LCRS (dashed line) galaxy surveys. These two estimates 
of Qz(s, u, v) do not agree because of the different passbands used for these two surveys. We also provide two values of u (namely u = 1.29 and u = 3.04) 
from the Jing & Borner (2004) data, i.e., we have not plotted the u = 2.09 data to avoid over-crowding. The binning has been chosen to be identical to that of 
Jing & Borner (2004). The solid (blue) circles are the SDSS Q z for the volume-limited sample as discussed in Section|3] while the solid (red) star symbols 
are the SDSS Q z for the SDSS magnitude limited sample. The solid error bars shown on these data-points are estimated using jack-knife re-sampling (see 
text), but with sub-regions 3 and 4 in Figurel2*lremoved (i.e., excluding the supercluster from these error bars). For comparison, the dot-dashed error bars on 
the red star symbols are our estimate of the jack-knife errors (the diagonal elements of the covariance matrix) using all 14 sub-regions to estimate the error 
(i.e., the effect of the supercluster is now including in the size of the error bar). In some cases, the error bars are smaller than the plotting symbols. The solid 
(green) triangle symbols are the SDSS Q z for the jack-knife re-sample excluding sub-regions 3 and 4 



catalogues of random data points (containing 8 x 10 5 points) over 
the same ar ea of the sky and with the same selection function as 
discussed in IPope etalj|2004l) . These random catalogues are then 
used to calculate edge effects on the N-point correlation fu nctions 
using the estimators presented in Szapudi & Szalay 1 1998). 
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where S12, S23 and S31 are the three sides of a triangle in red- 
shift space. Then Q(s,u,v) is defined by the ratio of the 3PCF 
C(si2, S23, S31), to sums of products of 2PCFs (e.g. £( s i2)£(,Si3) 
and permutations): 



4 RESULTS 

There are two common parametrizations of Q z . One defines 



,(s,u,v) 



C(si2,S23,S 3 l) 



C(S12)C(S23) + £(S23)£(S31) + £(*3l)C(»l 



(2) 
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Figure 2. Part of the SDSS volume-limited sample denned in Section|3] This redshift slice is approximately 500 by 200 h~ 1 Mpc in the dimensions shown, 
and ~ 100/i _1 Mpc thick (although here we have collapsed the slice in this 3 rd dimension). Most noticeable is the supercluster, which has been called the 
"Sloan Great Wall" by Gott et al. (2005) and is 1 .37 billion light years long. This supercluster is a combination of the Leo A and SCL126 superclusters (Einasto 
et al. 2001), and is associated with tens of known Abell clusters of galaxies. The two regions labelled 3 and 4 are two of the 14 sub-regions used in deriving 
the covariance matrices on our correlation functions as shown as error bars in FigureHI 




s (hr* Mpc) s 12 (h' 1 Mpc) 

Figure 3. (Left) The percentage difference between the 2PCFs for the 14 SDSS jack-knife datasets and the 2PCF as measured for the whole dataset (without 
any sub-regions excluded). The open stars are for the 12 jack-knife datasets with the supercluster shown in Figure|2]included, while solid triangles are for the 
dataset with sub-region 4 excluded (Figure|2| and solid circles are for the dataset with sub— region 3 excluded. (Right) The percentage difference between the 
3PCFs for the 14 SDSS jack-knife datasets and the 3PCF as measured for the whole dataset. The x-axis (S12) is the redshift-space distance for the shortest 
side of the triangle (see Eqn 2). The open stars are for jack-knife datasets with the supercluster shown in Figure|2|included, while solid triangles are for the 
dataset with sub-region 4 excluded (Figure|2| and solid circles are for the dataset with sub-region 3 excluded. All triangle configurations are plotted here, i.e., 
one point per triangle configuration in Figure 1, which explains why there are many data points with the same values of s and S12. 



The second parametrization has Q z (s, q, 6) with s — S12 being the 
shortest side of the redshift-space triangle, q — S23/S12, and 8 the 
angle between these two sides (S12 and S23). 

FigureQshows Q z (s , u, v) for bo th our volume-limited sam- 
ple (filled circles) and the lPope et alJ 120041) sample (filled stars). 
Different panels show results for a range of triangle configura- 
tions. To facilitate a direct comparison with result s from the lit- 
erature, we h ave used the same binning scheme as l.Ting &~B orner 
Jl998tl2004l) . in their analyses of the Las Campanas Redshift Sur- 
vey (LCRS) and 2dF Galaxy Redshift Survey (2dFGRS). The open 



circles show their results. Overall, our Q z {s, u, v) values are con- 
sistent with theirs, but with some obvious disagreements. For ex- 
ample , on large scales (S12 > 10ft _ 1 Mpc), we find larger Q z ~ 1, 
while Ijing & BorneJ 120041) find much smaller values. Although 
the different selection passbands of the 2dFGRS (bj) and SDSS 
(r— band) might account for this difference, it can not account for 
the dis agreement with the LCRS measurements of Jins & Borner 
( 1998) since the LCRS was also r-band selected. 

To quantify the disagreement, we estimated the covariances 
of our 3PCF estimates using the jack-knife re-sampling tech- 
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Figure 4. The SDSS Q z as a function of 0, q and S12 for the Pope et al. sample discussed in the text. We show 3 bins in s\2 (as labeled), while 8 is given in 
radians along the x-axis (bin width of A8 ± 0.05 radians about the central value plotted). The solid (green) circles are for q = 2, solid (red) stars are for q = 3 
and the solid (blue) triangles are for q = 4 (Aq ±0.5 about the central value). We also include a set of data points at the extremes of the 9 range, specifically 
0.0 < 6 < 0.02 and 0.98 < 8 < 1.00. The q = 2 and q = 4 data points are offset by —0.02 and +0.02 radians respectively to reduce overcrowding. 
Likewise, we only plot the full error bars (solid lines) on a fraction of the data points. The dot-dashed error bars have been calculated with sub-regions 3 and 
4 omitted (see Figure 2). 



nique ( discus sed in detail in lScranton et aljj2002l) and lZehavi et alJ 
(2002, 2005)). Briefly, the jack-knife resampling technique pro- 
vides an estimate of the "cosmic variance" within a sample. It is 
calculated by splitting the dataset into sub-regions and then mea- 
suring the variance seen between the estimated correlation func- 
tions as sub-regions are omitted one-by-one (therefore, if there 
are subregions, th ere are N correlatio n function estimates). As 
shown in Figure 2 of lZehavi et alJ i2005l) . the jack-knife errors ac- 
curately reproduce the "true error" (the dispersion measured be- 
tween 100 mock galaxy catalogues), especially for the diagonal 
terms of the covariance matrix of the 2PCF on large scales, (for 
r > 0.5/i _1 Mpc, the difference between the two error estimates 
is always < 10%). In what follows, we assume that the jack-knife 
error estimates are also accurate for the 3PCF. 



The SDSS dataset is built-up of thin "wedge-shaped" regions 
that are 2.5 degrees thick in declination and hundreds of degrees 
wide in right ascension (see York et al. 2000). We divided the total 
volume of our volume limited catalogue up into 14 sub-regions 
when estimating the covariance matrix. These were selected in 
Right Ascension along the SDSS scans. To illustrate, Figure [2] 
shows one of the redshift wedges; two of the sub-regions (namely 
sub-regions 3 and 4) are highlighted to provide an impression of 
the typical size of a subregion, but also because these two particu- 
lar regions will feature prominently in what follows. 

The error bars shown in FigureQshow the diagonal elements 
of the covariance matrices we estimate from the jack-knife method. 
The sizes of these diagonal elements (as well as the off-diagonal 
elements) are extremely sensitive to the inclusion or exclusion of 
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Figure S. The same as Figure|4]but now with sub-region 3 omitted from the calculation of Q z (si2, q, 0) for the Pope et al. (2004) sample. We have not 
plotted error bars. 



sub-regions 3 and 4. This sensitivity is quantified in Figure[3]which 
shows the scatter between the 14 2PCFs and 3PCFs used to con- 
struct the covariance matrices. The scatter in the 2PCFs between 12 
of the 14 jack-knife datasets, which contain the supercluster seen in 
Figure[2] is less than 10% on all scales probe d herein (s ^ 40 h~ 
Mpc) which is consistent with the findings of Ze havi et alj 1,2005). 
The 2 datasets which exclude sub-regions 3 and 4, have signifi- 
cantly different 2PCFs, up to 40% different on the largest scales, 
which is again consistent with lZehavi et alj 120051) who find that 
this supercluster greatly affects their 2PCF on large scales and is 
not accounted for by their estimates of the jack-knife errors. The 
effect on the 3PCF of the "Sloan Great Wall" is much greater. The 
jack-knife datasets that exclude sub-regions 3 and 4 (which con- 
tain the supercluster) differ by up to 70% (on large scales) com- 
pared to all other 3PCFs. 

In FigureQ we show the normalised 3PCF Q z for the whole 
dataset as well as for the datasets with sub-regions 3 and 4 ex- 
cluded. With the bulk of this supercluster excluded, the SDSS 3PCF 



values on l arge sc ales and is now in 



has much lower Q z (s,i 

good agreement with the lJing & Borned <2004l) 2dFGRS 3PCF on 
these large scales. This is also demonstrated in the error bars shown 
in FigureQwhich were estimated using all 14 jack-knife datasets 
(dot-dashed error bars) and for the 12 jack-knife datasets (solid er- 
ror bars) which excluded the supercluster (i.e., sub-regions 3 and 
4 removed). As expected, the sizes of these error bars are sensi- 
tive to the inclusion of the supercluster: if we exc lude the super- 
cluster , then our error bars are similar to those of Ijing &"B orner 
120041) . who assume an an alytical approximation for their errors. 
In addition, Jing & Borner (2004) used the 100k data release of 
the 2dFGRS and excluded areas of the 2dFGRS with R(0) < 0.1 
(areas with low reds hift completeness). As shown in Figure 15 of 
IColless et ail <200ll) . the northern strip of the 2dFGRS 100k data 
release has a large hole in its coverage between 12.5hrs and 13.5hrs 
in RA (due mainly to tilting constraints), which coin cides wit h sub- 
region 3 in Figure|2| Therefore, the sample used bv ljing & B orner 
(2004) does not include the main core of the "Sloan Great Wall" 
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and explains why our measurements of the 3PCF agree with theirs 
3PCF when we exclude sub-regi ons 3 & 4. 

| Baueh et alJ J2004h . lCroton et alJhoo4l . and lGaztafiaga et alJ 
1 2005) present an analysis of the higher-order correlation functions 
for the full 2dFGRS catalogue. In Figure 1 of Baugh et al. 1 2004), 
the " Sloan Great Wall" is visible in the NGP strip of the full 2dF- 
GRS. iBaugh et alJ 120041) also found that the presence of this su- 
percluster, and another in the 2dFGRS SGP area, significantly af- 
fected their measurement of the higher-order correlations on scales 
> A h~ Mpc, consis t ent wi th our findings in FiguresQand|3|(see 
also lGaztanaga et ail 120051) ). The influence of these superclusters 
on the higher-order correlation functions indicates that we have 
not yet reached a "fair sample" of the Universe with the 2dF- 
GRS and SDSS samples used herein. This was also examined by 
Hikag e et ail l2003h using the Minkowski Functions of the SDSS 
galaxy data (see their Fig. 8). 



5 DISCUSSION 

In Figure Q we find similar Q z (s,u, v) values for the two differ- 
ent samples discussed in Section [3] even though the Pope et al. 
sample probes ~ M* galaxies, while our volume-limited sample 
traces more lum inous galaxies at Mo. i r ^ —21. This co nfirms 
the findings of lKavo et all <2004l) and Ijing & BorneJ <2004 that 
there is no strong luminosity-depend ence in the Q z (s,u , v) pa- 
rameter (from -23 < Mo.i r < -19). ICroton et aljl2004l) also re- 
ports a weak luminosity dependence in the volume-averaged 3PCF, 
which could b e consistent with our m easurements given the error 
bars (see also Gaztanaga et al. 1 2005)). The lack of strong lumi- 
nosity dependence in 3PCF may be surprising given the strong 
luminosity depe ndence seen in th e 2dFGRS a nd SDSS 2PCFs 
iNorberg et ai]|200ll : IZehavi et alj|2005h . iKavo et alJ 120041) dis- 
cuss this behaviour further and conclude that galaxy bias must 
be complex on w e akly non-line a r to n o n-linear scales (but see 
IVerde et alJ <2002l) : ICroton et alJ 120041) : IGaztanaga etaU <2005l) 
for alternative interpretations). We will explore this weaker lumi- 
nosity dependence in future papers. 

Figure|4|presents the shape-dependence of Q z for the Pope et 
al. SDSS sample of galaxies, using the second of the two common 
conventions for Q z . Recall that this parametrization has Q z (s,q,8) 
with s — S12 being the shortest side of the redshift-space triangle, 
q ~ S23 /si2 and 9 the angle between S23 and S12 . Our choice of tri- 
angles is motivated by Figure 5 in the halo-m odel ICoorav & Shefhl 
120021) based analysis of Takada & Iain 1 2003), (although their anal- 
ysis was restricted to real-space rather than redshift-space trian- 
gles). To minimize overcrowding, we only show a subset of the 
error bars (the diagonal of the covariance matrix) on these data 
points. We also show the same error bars but with sub-regions 3 
and 4 omitted from the calculation of the covariance matrix. (Fig- 
uresQand[2|show that these two estimates of the error are similar 
on small scales but become significantly different on large scales.) 

On small scales (S12 < 2.5/i _1 Mpc), the shape of the nor- 
malised 3PCF is consistent (within the errors) for the different q 
values (see Figures|4]and[5j, and is close to a constant value (within 
the errors) as a function of 6, i.e., Q z (si2 < 2.5h~ Mpc) ~ 
0.75 ± 0.05. We see some evidence for a "U-shaped" behaviour 
in Q 2 on these small-sc ales, which is p redicted b y recent theoreti- 
cal mo dels of the 3PC F 1 Gaztanaga & Scoccimarro 2005). For ex- 
ample, Gaztanag a & Scocci marro 1 2005J) see a strong "U-shaped" 
pattern in Q z on small scales, i.e., in Figs 2 & 3 of their paper, 
they measure a factor of ~ 2 increase in both the Q z (9 ~ 5°) and 



Q z {9 ~ 175°) values, relative to the Q z (0 ~ 90°) values. We do 
not see as strong an effect as they clai m, but this could be due to 
our re latively coarse binning scheme as lGaztanaga & Scoccimarro! 
(2005) claim. We will explore this further in a future paper, with 
large datasets from the SDSS, but our Q z does have the same 
qualitative shape as they witnessed. We also note that our small- 
scale Q z measurements are in excellent agreem ent with the 2dF- 
GRS measurements of IGaztanaga et all 120051) . who also see the 
same weak "U-shaped" behavour (compared to simulations) and 
also have a near constant value of Q z (s\2 < 6h~ Mpc) ~ 0.75 
for their two different luminosity bins. This is remarkable agree- 
ment given the differences in the 2dFGRS and SDSS galaxy sur- 
veys. Finally, we comment that our values for Q z on small-scales 
are significantly smaller than the theoretical predictions for Q in 
real-space (which are Q ~ 3), but consistent with the expected 
decre ase in Q as one moves to redshift-space (see Figure 2 of 
IGaztanaga & Scoccimarrol (2005)). The value and shape of our Q z 
measurements are robust to the omission of the supercluster (see 
Figure|5J- 

The lack of any strong small-scale shape dependen ce of Q z is 
consistent w i th the 2dFGRS findings of ICroton et alJ <2004l) and 
Baugh et al. 1 2004), using volume-averaged 3PCFs. They found 
that the volume-averaged 3PCF scaled as, 



6( S )~S 3 £2( S ) , 



(3) 



where S3 displayed an weak luminosity dependence. Assuming 
little shape-dependence in Q z (s,q,6), then we can relate S3 to 
Q z by assuming the denominator in Eqn 1 of Q z simply becomes 
~3(6(s)) 2 ,andthusS 3 ~ 3 Q z . The value of S3 = 1.95 ±0.18 
derive d for L* gal axies in the 2dFGRS volume-averaged 3PCF 
iBaugh et al]l2004 is therefore in good agreement (within the er- 
rors) with our measured value of Q z ~ 0.75 on small scales for 
the Pope et al. sample (Figure |4j, which was designed to probe 
~ L* in the SDSS. This again demonstrates the relative insensitiv- 
ity of the 3PCF (in redshift-space) to the details of the selection of 
the galaxy sample. The simple scaling relationship given in Eqn|3l 
is expected for hierarchical structure formation models originating 
from Gaussian initial conditions 1 Peebles 1980; Baugh et alj2004l) . 

On larger scales (10 h^ 1 Mpc), the amplitude and shape- 
dependence of Q z changes significantly once the supercluster has 
been removed (comparing Figures |4] and For example, for the 
q — 2 triangle configurations (circle symbols), the "U-shape" in 
Q z is only seen once the core of the "Sloan Great Wall" has been 
removed. Likewise, "U-shape" behavour of Q z for the q — 3 tri- 
angle configurations (star symbols) is enhanced (by nearly a factor 
of 3) when the supercluster is removed, and is then in b etter agree- 
ment with the numerical simulations of Gaztanaga & Scoccimarro 
(2005) and measurements for the 2dFGRS ( Gazta naga et al] 2005). 
Therefore, the expected "U-shaped" signal in Q z due to filamen- 
tary stuctures in the Universe has been overwhelmed by the pres- 
ence of the supercluster, and is only seen when the "Sloan Great 
Wall" is removed. This indicates that the "Sloan Great Wall" has 
a different topology than filaments (e.g. sheet-like) or this differ- 
ence is caused by the orientation of this supercluster in the SDSS 
(it appears to be perpendicular to the line-of-sight). Overall, the 
3PCF is hard to measure on these large scales using the samples 
presented herein, and the errors are dominated by the "Sloan Great 
Wall". Larger samples, in both volume and numbers of galaxies, are 
required to explore the shape-dependence of the 3PCF in greater 
detail on these large scales, and that should be possible with future 
SDSS samples. 
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6 APPENDIX A: THE 3PCF DATA 

We present here the data points from Figures 4 & 5. We present 
the upper and lower limits of the bins used. We stress that these 
data are affected by large scale structures in the data and, therefore, 
should be used with caution. We present these data to aid in the 
comparison with other observations and theoretical predictions. 
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Table 1. The data presented in Figure 4 of this paper 
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Table 2. The data presented in Figure 5 of this paper 
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